Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

DocMining: A Document Analysis System Builder

Identifieur interne : 006C13 ( Main/Exploration ); précédent : 006C12; suivant : 006C14

DocMining: A Document Analysis System Builder

Auteurs : Sébastien Adam [France] ; Maurizio Rigamonti [Suisse] ; Eric Clavier [France] ; Éric Trupin [France] ; Jean-Marc Ogier [France] ; Karl Tombre [France] ; Joël Gardes [France]

Source :

RBID : ISTEX:2121813173D01AB1839006BADA8C546C699215D5

Descripteurs français

English descriptors

Abstract

Abstract: In this paper, we present DocMining, a general framework that allows the construction of scenarios dedicated to document image processing. The framework is the result of the collaboration between four academic partners and one industrial partner. The main issues of DocMining are the description and the execution of document analysis scenarios. The explicit declaration of scenarios and the plug-ins oriented approach of the framework allow to integrate easily new Document Processing Units and to create new application prototypes. Moreover, this paper highlights the interest of the platform to solve the problem of performance evaluation.

Url:
DOI: 10.1007/978-3-540-28640-0_45


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">DocMining: A Document Analysis System Builder</title>
<author>
<name sortKey="Adam, Sebastien" sort="Adam, Sebastien" uniqKey="Adam S" first="Sébastien" last="Adam">Sébastien Adam</name>
</author>
<author>
<name sortKey="Rigamonti, Maurizio" sort="Rigamonti, Maurizio" uniqKey="Rigamonti M" first="Maurizio" last="Rigamonti">Maurizio Rigamonti</name>
</author>
<author>
<name sortKey="Clavier, Eric" sort="Clavier, Eric" uniqKey="Clavier E" first="Eric" last="Clavier">Eric Clavier</name>
</author>
<author>
<name sortKey="Trupin, Eric" sort="Trupin, Eric" uniqKey="Trupin E" first="Éric" last="Trupin">Éric Trupin</name>
</author>
<author>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
</author>
<author>
<name sortKey="Tombre, Karl" sort="Tombre, Karl" uniqKey="Tombre K" first="Karl" last="Tombre">Karl Tombre</name>
</author>
<author>
<name sortKey="Gardes, Joel" sort="Gardes, Joel" uniqKey="Gardes J" first="Joël" last="Gardes">Joël Gardes</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:2121813173D01AB1839006BADA8C546C699215D5</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-28640-0_45</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-0PK8G568-2/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000748</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000748</idno>
<idno type="wicri:Area/Istex/Curation">000743</idno>
<idno type="wicri:Area/Istex/Checkpoint">001822</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">001822</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Adam S:docmining:a:document</idno>
<idno type="wicri:Area/Main/Merge">006F17</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:04-0536194</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000618</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000423</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000611</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000611</idno>
<idno type="wicri:doubleKey">0302-9743:2004:Adam S:docmining:a:document</idno>
<idno type="wicri:Area/Main/Merge">007046</idno>
<idno type="wicri:Area/Main/Curation">006C13</idno>
<idno type="wicri:Area/Main/Exploration">006C13</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">DocMining: A Document Analysis System Builder</title>
<author>
<name sortKey="Adam, Sebastien" sort="Adam, Sebastien" uniqKey="Adam S" first="Sébastien" last="Adam">Sébastien Adam</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire PSI – CNRS FRE 2645, Université de Rouen, Place Emile Blondel, 76821, Mont Saint Aignan CEDEX</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Mont Saint Aignan</settlement>
</placeName>
<orgName type="university">Université de Rouen</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Rigamonti, Maurizio" sort="Rigamonti, Maurizio" uniqKey="Rigamonti M" first="Maurizio" last="Rigamonti">Maurizio Rigamonti</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Suisse</country>
<wicri:regionArea>DIVA Group, DIUF, Université de Fribourg, Ch. Du Musée 3, 1700, Fribourg</wicri:regionArea>
<wicri:noRegion>Fribourg</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Suisse</country>
</affiliation>
</author>
<author>
<name sortKey="Clavier, Eric" sort="Clavier, Eric" uniqKey="Clavier E" first="Eric" last="Clavier">Eric Clavier</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>France Telecom R&D, 2 Avenue Pierre Marzin, 22307, Lannion CEDEX</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Lannion</settlement>
</placeName>
</affiliation>
<affiliation></affiliation>
</author>
<author>
<name sortKey="Trupin, Eric" sort="Trupin, Eric" uniqKey="Trupin E" first="Éric" last="Trupin">Éric Trupin</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire PSI – CNRS FRE 2645, Université de Rouen, Place Emile Blondel, 76821, Mont Saint Aignan CEDEX</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
<settlement type="city">Mont Saint Aignan</settlement>
</placeName>
<orgName type="university">Université de Rouen</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire L3i, Université de la Rochelle, Avenue Michel Crépeau, 17042, La Rochelle CEDEX</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Nouvelle-Aquitaine</region>
<region type="old region" nuts="2">Poitou-Charentes</region>
<settlement type="city">La Rochelle</settlement>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Tombre, Karl" sort="Tombre, Karl" uniqKey="Tombre K" first="Karl" last="Tombre">Karl Tombre</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, INRIA, B.P.239, 54506, Vandoeuvre-lès-Nancy CEDEX</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Gardes, Joel" sort="Gardes, Joel" uniqKey="Gardes J" first="Joël" last="Gardes">Joël Gardes</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>France Telecom R&D, 2 Avenue Pierre Marzin, 22307, Lannion CEDEX</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Lannion</settlement>
</placeName>
</affiliation>
<affiliation></affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Data analysis</term>
<term>Declaration</term>
<term>Document analysis</term>
<term>Document processing</term>
<term>Document structure</term>
<term>Image processing</term>
<term>Performance evaluation</term>
<term>Script</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Analyse documentaire</term>
<term>Analyse donnée</term>
<term>Evaluation performance</term>
<term>Instruction déclaration</term>
<term>Scénario</term>
<term>Structure document</term>
<term>Traitement document</term>
<term>Traitement image</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: In this paper, we present DocMining, a general framework that allows the construction of scenarios dedicated to document image processing. The framework is the result of the collaboration between four academic partners and one industrial partner. The main issues of DocMining are the description and the execution of document analysis scenarios. The explicit declaration of scenarios and the plug-ins oriented approach of the framework allow to integrate easily new Document Processing Units and to create new application prototypes. Moreover, this paper highlights the interest of the platform to solve the problem of performance evaluation.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Suisse</li>
</country>
<region>
<li>Grand Est</li>
<li>Haute-Normandie</li>
<li>Lorraine (région)</li>
<li>Nouvelle-Aquitaine</li>
<li>Poitou-Charentes</li>
<li>Région Bretagne</li>
<li>Région Normandie</li>
</region>
<settlement>
<li>La Rochelle</li>
<li>Lannion</li>
<li>Mont Saint Aignan</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
<orgName>
<li>Université de La Rochelle</li>
<li>Université de Rouen</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Région Normandie">
<name sortKey="Adam, Sebastien" sort="Adam, Sebastien" uniqKey="Adam S" first="Sébastien" last="Adam">Sébastien Adam</name>
</region>
<name sortKey="Adam, Sebastien" sort="Adam, Sebastien" uniqKey="Adam S" first="Sébastien" last="Adam">Sébastien Adam</name>
<name sortKey="Clavier, Eric" sort="Clavier, Eric" uniqKey="Clavier E" first="Eric" last="Clavier">Eric Clavier</name>
<name sortKey="Gardes, Joel" sort="Gardes, Joel" uniqKey="Gardes J" first="Joël" last="Gardes">Joël Gardes</name>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
<name sortKey="Ogier, Jean Marc" sort="Ogier, Jean Marc" uniqKey="Ogier J" first="Jean-Marc" last="Ogier">Jean-Marc Ogier</name>
<name sortKey="Tombre, Karl" sort="Tombre, Karl" uniqKey="Tombre K" first="Karl" last="Tombre">Karl Tombre</name>
<name sortKey="Tombre, Karl" sort="Tombre, Karl" uniqKey="Tombre K" first="Karl" last="Tombre">Karl Tombre</name>
<name sortKey="Trupin, Eric" sort="Trupin, Eric" uniqKey="Trupin E" first="Éric" last="Trupin">Éric Trupin</name>
<name sortKey="Trupin, Eric" sort="Trupin, Eric" uniqKey="Trupin E" first="Éric" last="Trupin">Éric Trupin</name>
</country>
<country name="Suisse">
<noRegion>
<name sortKey="Rigamonti, Maurizio" sort="Rigamonti, Maurizio" uniqKey="Rigamonti M" first="Maurizio" last="Rigamonti">Maurizio Rigamonti</name>
</noRegion>
<name sortKey="Rigamonti, Maurizio" sort="Rigamonti, Maurizio" uniqKey="Rigamonti M" first="Maurizio" last="Rigamonti">Maurizio Rigamonti</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 006C13 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 006C13 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:2121813173D01AB1839006BADA8C546C699215D5
   |texte=   DocMining: A Document Analysis System Builder
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022